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Abstract 

In downlink multiuser multiple-input multiple-output (MU-MIMO) systems, users are practically 
heterogeneous in nature. However, most of the existing user scheduling algorithms are designed with an 
implicit assumption that the users are homogeneous. In this paper, we revisit the problem by exploring 
the characteristics of heterogeneous users from a subspace point of view. With an objective of minimizing 
interference non-orthogonality among users, three new angular-based user scheduling criteria that can be 
applied in various user scheduling algorithms are proposed. While the first criterion is heuristically de- 
■ termined by identifying the incapability of largest principal angle to characterize the subspace correlation 

and hence the interference non-orthogonality between users, the second and third ones are derived by 

o ; 

using, respectively, the sum rate capacity bounds with block diagonalization and the change in capacity 
by adding a new user into an existing user subset. Aiming at capturing fairness among heterogeneous 
users while maintaining multiuser diversity gain, two new hybrid user scheduling algorithms are also 

X 

' proposed whose computational complexities are only linearly proportional to the number of users. We 



show by simulations that the effectiveness of our proposed user scheduling criteria and algorithms with 
respect to those commonly used in homogeneous environment. 
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I. Introduction 

Multiuser Multiple-Input Multiple-Output (MU-MIMO) systems have attracted a lot of interest and 
being considered as a promising technology in various beyond-3G standards [1] due to its significant 
throughout improvement with respect to the single-user counterpart and its support of key features such 
as multiuser diversity and user multiplexing [2]. In order to achieve sum rate capacity of MIMO broadcast 
channels, one can apply Dirty Paper Coding (DPC) [3,4] but it comes at the expense of huge computa- 
tional and implementation complexity. Aiming at maximizing the sum rate capacity, a great number of 
low-complexity linear precoding algorithms have been proposed in which Block Diagonalization (BD) 
is one of the popular choices due to its capability of approaching the capacity and its ease in practical 
implementation [5]. 

In an overloaded downlink MU-MIMO system that supports a very large number of users, user schedul- 
ing is necessary as base station (BS) cannot usually serve such a large number of users simultaneously 
because of the following two key reasons. First, there are far more users to be supported than the number 
of transmit antennas available at BS, which violates the dimensionality constraint of BD [5]. Second, 
interference non-orthogonality among users always exists [6]. In other words, instantaneous channels 
among users are non-orthogonal to one another, which result in mutual inter-user interference. 

There are generally two common types of scheduling algorithms, namely user selection and user 
grouping. For user selection algorithms whose objective is to select a subset of users for scheduling, it is 
natural to find an optimal subset by using exhaustive search but it is very computationally demanding even 
for moderate number of users. In this context, a large number of sub-optimal yet simplified algorithms 
has been proposed whose fundamental idea is to maximize system performance according to various user 
selection criteria [6-9]. In [6], Yoo and Goldsmith construct a subset of semi-orthogonal users with a 
heuristic user selection criterion, which is shown to achieve an optimal capacity scaling when the number 
of users is asymptotically large. Dimic et al. [7], propose a sub-optimal greedy algorithm to maximize 
the sum rate capacity by selecting a user who has the maximum projected norm on the null space of 
the existing users in a user subset. For users with multiple receive antennas, eigenmode and/or antenna 
selection is associated with user selection in order to achieve the best performance in terms of sum rate 
capacity [8,9]. On the other hand, user grouping algorithms take fairness among users into account and 
schedule all users to be served over consecutive scheduling units, e.g., timeslots if Time Division Multiple 
Access (TDMA) is employed. In particular, all users are divided into a number of groups by certain 
criteria [10,11], whose aim is to maximize system performance while minimizing spatial correlation 
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among users per group. Alternatively, one can consider applying proportional fair scheduling (PFS) [12] 
into user selection algorithms for capturing fairness among users while maintaining multiuser diversity 
gain [9]. Though its performance is superior to the user grouping algorithms because of the multiuser 
diversity gain, PFS provides neither fairness nor delay guarantee [6]. 

For the above-mentioned user scheduling algorithms, an effective performance metric is required in 
selecting either an optimal subset of users or an optimal scheduling arrangement. There is a large body 
of literature focused on uncorrelated downlink MU-MIMO systems with homogeneous users (see [13- 
16] and the references cited therein). For systems with heterogeneous users, the task of designing an 
efficient user scheduling metric becomes more challenging because there are more system parameters 
(for example, different number of receive antennas and different received signal-to-noise ratios (SNRs)) 
to be considered. Though there are some recent works that consider scheduling strategies for users with 
different received SNRs [17] and those with different number of receive antennas [18], there is a lack of 
works addressing the combined problem based on the scheduling criteria. Naturally, an interesting question 
arouses in mind is, whether those heuristical scheduling criteria/metrics employed in homogeneous MIMO 
broadcast channels are still applicable in heterogeneous environment. In this paper, we try to answer this 
question by studying users' channel characteristics in a subspace approach and designing effective user 
scheduling metrics from a geometric point of view. The main contributions of this paper are summarized 
as follows. 

• In Section III, we take into account the characteristics of principal angles between channels of het- 
erogeneous users and propose three angular-based scheduling criteria that show superior performance 
than the existing ones in terms of sum rate capacity. These proposed criteria include geometrical 
angle (i.e., product of principal angles), grouping-oriented criterion that is derived by using an upper 
and a lower bounds of sum rate capacity with BD, and selection-oriented criterion that is derived by 
approximating the change in sum rate capacity due to the enrollment of a new user into an existing 
user subset. 

• In Section IV, we propose two hybrid user scheduling algorithms that takes into account some key 
features of user grouping and selection algorithms, i.e., to capture fairness among users and to 
maximize the system performance in a greedy manner. When compared with the conventional user 
grouping algorithms [10,11] that require an exhaustive search for all possible grouping arrange- 
ments, no brute-force search is required for our algorithms and their computational complexities 
are only linearly, rather than exponentially, proportional to the total number of users in the system. 
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Further, simulation results in Section V reveal the effectiveness of these algorithms in heterogeneous 

environment despite a reduction in the user's search space. 
Notation: Matrices and vectors are represented as uppercase and lowercase letters, respectively. Trans- 
pose and conjugate transpose of a matrix are denoted as (-) T and (-) H , respectively. Further, we re- 
serve diag{-} for an diagonal matrix, while det(-), rank(-), tr(-), A(-), Aj(-), and || • \\p represent 
the determinant, rank, trace, diagonal part, the i-th singular value, and Frobenius norm of a matrix. In 
addition, we denote n as an intersection of two subspaces, (■) ± as the null space of a subspace, and |-| 
as the cardinality of a set. Lastly, E{-}, [-\ and abs(-) denote the statistical expectation, floor operation, 
and absolute value operation, respectively. 

II. System Model 

Consider a downlink MU-MIMO system with Mt transmit antennas at BS and K heterogeneous 
users 1 that are equipped with Mn k receive antennas at the fc-th user as illustrated in Fig. 1. We consider 
overloaded scenarios (i.e., My <C Ylk=i MrJ in which the BS cannot serve all the users simultaneously. 
For this reason, user scheduling is necessary to serve either a subset of users at one time or all users 
once over an entire scheduling period of T timeslots. 

Consider a subset of users T that has been scheduled for transmission. Denote as the transmit 
signal of user-A; in the group (i.e., k € T). Its receive signal is given by 

y k = H fc F fc x fe + ^2 H fc F;x; + n fe , (1) 

where G C Mr >> xMt is the channel matrix between the BS and user-A; and it can be further expressed 
as Hfc = v /pfcHfc with pk being the average received power of the k-th user and is an arbitrary matrix 
that depends on the channel model employed. Further, is a precoding matrix of the k-th user. For 
practical consideration, we adopt BD [5] as the linear precoder of the system, is the additive white 
Gaussian noise (AWGN) at the receive antennas of user-A; and is assumed to be zero-mean independent 
and identically distributed (i.i.d.) Gaussian vector with variance a\, i.e., ~ CM(0, a^)- 

Assume all channels are time-invariant during the scheduling period and the BS has perfect channel 
state information for all users. In order to track the influence of received SNR, we follow [17] and 
consider it as a function of the distance between the BS and a user, path loss exponent, and average 
transmit power per antenna. 

'Unlike homogeneous counterpart, heterogeneous users are of different antenna configurations and/or experience different 
channel environments. 
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III. User Scheduling Criteria for Heterogeneous Users 

In this section, we first briefly go over the existing user scheduling criteria for downlink correlated 
MU-MIMO systems with homogeneous users and then present our proposed scheduling criteria for 
heterogeneous users. 

A. Review on Metric Choice for Homogeneous Users 

Recall in Section II that we consider an overloaded downlink MU-MIMO system in which there are 
a large number of users K and the total number of receive antennas is far greater than that of the total 
number of transmit antennas at the BS, i.e., Ylk=i Mn k S> My. Since the number of users that the 
system can support is limited by the dimensionality constraint of BD [5] and the instantaneous channel 
matrices among users are generally non-orthogonal, user scheduling is necessary. 

Considering channel matrices as the subspace spanned by their column vectors, mutual interference 
across users can be represented as the correlation of the corresponding subspaces. Some recent works 
have been attempted to measure the level of subspace correlation in either angular or subspace domains 
and utilize them as user scheduling metrics. In the following, we will review three common ones, namely 
the largest principal angle, subspace collinearity, and chordal distance. 

1 ) Largest Principal Angle: In order to facilitate our subsequent discussion, we first review the 
definition of principal angle, or equivalently, canonical angle [19,20]. 

Definition 1 (Principal Angle [21,22]). For any two nonzero subspaces, namely, U k ,Vj C C n with p = 
min {dim(Uk), dim(Vj)}, the principal angles between U k ,i = U k and Vj t i = Vj are recursively defined 
to be the numbers < 6k,j,i < vr/2 such that 

cos9 k j,i = max {uji6Mjiil)Vie v iil ,||u lb || !1 =||v,||2=i}yf u * ( 2 ) 
= y^Ufc.i, i = 1,2, - •• ,p, (3) 

where u k j and v^j are the vectors that construct the i-th principal angle 9 k ,j,i> || u fc,j||2 = || v j,i||2 = 1» 
U k ,i = Uk,i-i n u^_ x and V j:i = Vjj-i n 2 - Further, 

0k,j,min = @k,j,l — @k,j,2 ^ " " " ^ @k,j,p — dk,j,maxi (4) 

and 9 k ,j t p = k ,j, max when dim(U k ) = dim(Vj). □ 

2 From a space's viewpoint, Vj,»-i n represents the intersection of Vjj-i and the null space of Vj,;_i, which means 

that the representative subspace of Vj ; i-i is removed from the subspace Vj,»-i. 
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From the definition, we can find that the cosine of the principal angle is the inner product of two 
vectors of interest, and the minimal principal angle represents the largest inner product of any vectors in 
the two subspaces. If there exists intersection between two subspaces, the minimal principal angle would 
be zero. 

In [11], Wang and Murch proposed a user scheduling algorithm that aims at minimizing spatial 
correlation across users by arranging users with no or low inter-user spatial correlation into one group. 
In order to determine the impact of spatial correlation of the k-th user due to the remaining users, they 
utilize the largest principal angle between the orthogonal basis of the row spaces of user-fc's transmission 

T 



channel and its interference channel 



The resulting largest 



n l ' ' ' ' ' n fc-l> rl fc+l' " ' n K 

principal angle of each user is then used as a scheduling metric for optimal group arrangement. In 
Section IV, we will present this algorithm in detail and discuss why it motivates us for proposing two 
reduced-complexity greedy-based hybrid user scheduling algorithms. 

2 ) Subspace Collinearity : Subspace collinearity is a criterion that reflects the similarity of two matrix 
subspaces and it can be used for characterizing users' spatial separability [23,24]. In general, given two 
matrices and Mg, their collinearity can be represented as [25] 

abs (tr(M A M%)) 

where < coI(Ma,Mb) < 1, and co/(Ma,M#) = when and Mb are orthogonal to each other, 
while coI(Ma, Mb) = 1 when the matrices are identical. It is clear that the smaller the collinearity is, 
the less similarity, or equivalently the larger distance, of the two matrix subspaces. 

3) Chordal Distance: Chordal distance is commonly used in limited feedback systems [26,27] for 
codebook design but it has also been recently considered as a user scheduling criterion [28]. As referred 
to [29], the chordal distance between two subspaces Uk and Vj with dimensions p and q is defined in 
terms of the user's principal angles as follows 



d c (U k ,Vj) = ^|l|Pz4 -Pv> = ^ 



min{p,(j} 

^2 sm2 Ok,j,i, 



i=i 

where P^ fc and Py. are the projection matrices of Uk and Vj, respectively. Note that we only use the 
lower dimension, i.e., min{p, q}, of the principal angles for the definition. 
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B. Proposed User Scheduling Criteria 

In downlink MU-MIMO channel with heterogeneous users, principal angles between two subspaces 
of dimensions 3 p and q possess the following specific characteristics, namely 

TT 

= @k,j,l = ■ ■ ■ = Okj, m < 9k,j,m+l < ■ ' ' < $fc,j,n < 9k,j,n+l = ■ ■ ■ = 9k,j,p = 77> (6) 

N v ' N v ' N v ' 1 

Part I Part II Part III 

where 9k,j,i and 9k,j,p are the minimum and maximum principal angles, respectively. These three parts 
represent different physical meanings on subspace correlation (that models the mutual interference among 
users) and they are summarized as follows. For Part I that consists of m zero principal angles, they 
represent m overlapped and fully-correlated basis of the two subspaces. As for the second part that is 
composed of n — m principal angles whose values lie between and it/2, the corresponding basis of the 
two subspaces are non-overlapped but non-orthogonal with one another, which result in partial subspace 
correlation. Regarding the third part that contains q — n principal angles of tt/2, it means there are 
q — n orthogonal principal angles. 

In order to understand the characteristics of (6), we consider an example in which there are two 
heterogeneous users k and j with p = Mn k and q = Mr. receive antennas, respectively, and their 
instantaneous channel matrices are non-orthogonal to one another because of, e.g., their close proximity. 
Denote Uk and Vj as the two subspaces of dimensions p and q that are spanned by the columns of 
their channel matrices. Assuming that the overlapped dimension of the channel subspaces is m, we 
have dim(Uk n Vj) = m and hence m zero principal angles (c.f., Part I in (6)). For the non-overlapped 
counterpart of dimension p — m, there exists p — n mutually orthogonal components that correspond to 
p — n largest principal angle of tt/2 (c.f., Part III), while the remaining n — m ones are non-orthogonal 
with one another and they represent those principal angles with values between and tt/2 (c.f., Part II). 

As referred to the characteristics above, it is clear that subspace correlation is reduced if the prin- 
cipal angles are as large as possible. For user scheduling algorithms that aim at minimizing subspace 
correlation and hence interference non-orthogonality among users, it means that the corresponding user 
selection/grouping criteria should be designed in such a way that users with a smaller dimension of Part I 
and a larger dimension of Part III can be served simultaneously. This important observation leads us to 
understand that single-dimensional information, e.g., the largest/smallest principal angles, is sometimes not 
enough to characterize the correlation between two subspaces. For example, if and Vj are subspaces of 
unequal dimensions that have a nontrivial intersection, then 9k,j, m i n = and 9kj, ma x = n/2, but neither 

'Without loss of generality, we assume p < q. 
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of them might convey the desired information on whether these two subspaces are highly correlated or 
not. Similarly, though subspace collinearity reflects the similarity of two subspaces to some extent, it is 
an indirect measure because of its heuristic reflection of the orthogonality of the channel matrices. As 
for chordal distance, it is similar to the subspace collinearity that requires the two compared subspaces 
Uk and Vk to be of the same dimension. If the subspaces are of different dimensions, the lower dimension 
is usually adopted, which may result in an inaccurate measure for systems with heterogeneous users. 

In view of the disadvantages of these three metrics that do not accurately reflect users' spatial sepa- 
rability, it would be interesting to consider not only the largest and smallest principal angles, but also 
those "intermediate angles". In the following, we propose three other user scheduling criteria that take 
into account all principal angles as given in (6). 

1) Geometrical Angle: Let Uk = span-fu^i, 11^2, • • • , Ufc iP } and Vj = span{\j^,\j^-, • • • , v j,<?} be 
two subspaces with 1 < p < q. Geometrical angle, i.e., the angle ipkj = £(Uk,Vj) between the two 
subspaces, is defined as [30,31] 

p 

cos 2 V>fcj = Y[cos 2 9 kJji , (7) 

i=i 

where u^, v^j and 6k,j,i are defined in Definition 1. Given two users k and j with channel matrices 

and EL, , respectively, without loss of generality, we assume p = rank(Hk) < rank(Hj) = q. Geometrical 

angle can be alternatively defined as [31] 

2 , det(M fcj M^.) 

cos tph j = Hz—, (8) 

^ det(H fe Hf ) 

where j = H^H^ is a cross-correlation matrix that is represented in inner product form. 



In general, the value of cos 2 ipkj represents the ratio between the volume of the parallelepiped spanned 



. ^-f „/,. . 

hi 

by the projection of the basis vectors of the lower dimension subspace on the higher dimension subspace 
and the volume of the parallelepiped spanned by the basis vectors of the lower dimension subspace. 
Simply put, geometrical angle reflects the relationship between the projection of the smaller dimensional 
subspace onto the larger dimensional subspace and itself [31]. Fig. 2(a) shows a two-dimensional example. 
By definition, geometrical angle refers to the ratio of the area of the projection of plane-A onto plane-B 
to that of plane-A itself. It is obvious that the ratio cos 2 ipk,j is larger (or equivalently, the angle ip^j i s 
smaller) when the planes are closer, and vice versa. 

In our case, the level of subspace correlation between two users is characterized by the degree of 
overlapping between the corresponding channel subspaces. In particular, if the correlation is severe, the 
corresponding channel subspaces get closer to each other and hence the geometrical angle is smaller. In 
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other words, there are more principal angles of zeros and less principal angles of ir/2 in (6). Aiming at 
minimizing subspace correlation and hence interference non-orthogonality among the scheduled users, 
an effective user scheduling criterion should select for users with larger geometrical angles ip k j or 
equivalently, a smaller value of cos 2 -0jfc,j, i.e., 

M(H k ,Hj) = arg min cos 2 ^ fc (9) 
kjec 

where C is the candidate user pool. The metric M. (H^ , Hj ) in (9) is introduced to denote the correlation 
of two channel matrices H k and Hj, and is used as a scheduling criterion for user grouping. 

2) Grouping-Oriented Criterion: Recall from our discussion in (6) that users with a smaller dimension 
of correlated basis (i.e., Part I) and a larger dimension of orthogonal basis (i.e., Part III) are preferred to 
be served together. Therefore, it is important for a user grouping algorithm to take into account subspace 
correlation among users so as to minimize interference orthogonality on each group while maximizing 
the sum rate capacity. In the following, we re-express the sum rate capacity with BD in terms of principal 
angles and derive a scheduling metric that satisfies these two objectives. 

Theorem 1 (Sum rate capacity bounds of BD). For a K -heterogeneous user downlink MU-MIMO system, 
the sum rate capacity with BD is bounded as 



K I M Rk 

E l0g 2 1 + Z5 X lmin E Sin2 9 k,k,i 
k=l \ n i=l 



^ Csum 



K I M Rk 

k=l \ flt " i=l 



(10) 

where 6 k r . is the i-th principal angle between the range space of the k-th user's channel Hfc and its 
interference channel Moreover, \k,rnin cind \k,max are the minimum and maximum singular values 
ofH k . □ 

Proof 1. Please refer to Appendix A for details. 

It can be clearly observed from Theorem 1 that the sum rate capacity of the k-th user is increased 
monotonically with its J2i=i° sm2 ®k k i anc ^ tne P er f° rmance metric can be written as 

M(H. k ,H k ) = argmax ^ sin 2 (9^. (11) 
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This metric reflects the correlation of channel matrices between user-fc and the other users that intend to 
group together, and it is used as a scheduling criterion for user grouping. In order to take into account 
those zero principal angles corresponding to the overlapped subspaces (c.f., Part I in (6)), however, we 
prefer to use the cosine function because the sine of zero principal angle is equal to zero. Hence, our 
grouping-oriented scheduling criterion is embodied as 

M(H k ,U k ) = argmin^cos 2 ^, (12) 

i=i 

which reflects the similarity of the channels between a user and the other users in the group. 

Although the above-mentioned bound is not tight enough 4 , it is sufficient to determine the relationship 
between one user and the other group members as will be shown by simulations in Section V. 

3) Selection-Oriented Criterion: While the objective of a user grouping algorithm is to serve all users 
once over an entire scheduling period, user selection algorithm serves only a subset of users at one time. 
For a selected user subset, an addition of a new user would induce a change in sum rate capacity AC 
that can be separated into two components [33], namely the incremental gain in sum rate capacity C ga i n 
and the incremental capacity degradation Cy oss due to the interference of this new user on the existing 
users. It is clear that if the gain surpasses the loss, the incoming user would exert an positive influence 
on the sum rate capacity, and vice versa. In view of this, one promising user selection criterion is to 
evaluate the impact of each user from the candidate user pool on the change in sum rate capacity of the 
user subset, followed by enrolling a user into the subset if it brings the largest and positive value of AC. 
In the following, we derive a user selection criterion by quantifying the capacity change in terms of the 
geometrical angles. 

Theorem 2 (Change in sum rate capacity due to a new-user addition [34]). For a K -heterogeneous user 
downlink MU-MIMO system, the change in sum rate capacity when a new user is added in a selected 
user subset T is 

AC = Cg a i n — Ci oss , (13) 

where C ga i n quantifies the gain in sum rate capacity due to a new incoming user (user k) and it is 

4 In [32], Shen et al. derived a lower bound on the ergodic sum capacity with BD in terms of unordered eigenvalues of complex 
Wishart matrices. While their objective is to derive a tight semi-closed-form expression and perform an analytical comparison 
between BD and DPC, our derivation aims at investigating into the relationship between the sum rate capacity and the principal 
angles of users. 
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approximated as 

Cgain ~ log 2 ^det(H fc Hf)sin 2 ^ , (14) 

with ipk,s = -^(Hfe, H s ) being the geometrical angle between the range spaces of the channel matrix of the 
new user (Hfc) and the aggregated channel matrix of the existing users in the selected user subset (HJ, 
while Ci oss denotes the loss in sum rate capacity resulting from the interference of this new user to 
existing users and it is given by 

Class ~ lo §2 ( % det (HjHf ) sin 2 sin 2 ^ s \j J , (15) 
jeT V£J ™ J 

with ipj yS \j — ^(Hj, H s \Hj) ^e/wg f/ie geometrical angle between the range spaces of the channel matrix 

of the j-th user in the subset (ilj) and the aggregated channel matrix of the other selected users H s \Hj, 

and similar definition holds for ipk,s\j = ^(Ht, H s \Hj). □ 

Proof 2. Please refer to Appendix B for details. 

Regarding the loss in sum rate capacity Ci oss , it can be geometrically interpreted by the following 
example. Suppose there are two users (user-1 and user-2) in an existing user subset and a new incoming 
user (user-3) whose effective channel lies in the intersection of the null spaces of Hi and H2. As referred 
to (15), the capacity loss due to user-3 is mainly due to two components: one is the projection of user-3's 
channel onto the null space of Hi followed by the range space of H2, and the other one is the projection 
of user-3 's channel onto the null space of H2 and then the range space of Hi. As shown in Fig. 2(b), 
the first component is equivalent to recursively project OA onto OB and then OC. Similarly, the second 
component is geometrically equivalent to a recursive projection of OA onto OE followed by OD. 

From Theorem 2, it is clear that the corresponding performance metric is to maximize the incremental 
improvement in (13), i.e., 

hA( „ ^ {|rhl) Pfc det(H fc Hf)sin 2 V M 
M(H k ,U s ) = argmax /- - \ ' (16) 

keC UjerPj det ( H J H f ) sin2 <Pj,s\3 sin2 ^k,s\3 
which measures the influence introduced by the incoming user (denoted by Hfc) on the already selected 
users (denoted by H s the concatenated channel matrices), and it is used as a scheduling criterion for 
user selection or hybrid user scheduling. 
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Alternatively, a simplified user selection criterion is to solely consider C ga i n , namely 

M(H k ,H s ) = argmax/jfcdet (H fc Hf ) sin 2 ^ fciS , (17) 

= argmaxp fe (det(H fc H^) - det (H fc H?H 8 fi£)) (18) 

kac 

= argma^p fe det(H fe H^H^Hf), (19) 

where (18) is due to the alternative definition of geometrical angle as given in (8). Geometrically, this 
simplified criterion (19) refers to the volume of the parallelepiped spanned by the projection of the basis 
vectors of the range space of onto the null space of H s , i.e., the aggregated channel matrix of the 
scheduled users. Moreover, when compared (19) with an alternative expression of geometrical angle to (9) 
in terms of sine function 5 , their difference lies on the volume of channel matrix, i.e., p k det (H^H^). 
Nevertheless, the impact of the channel volume on vanishes when the number of users is asymptotically 
large because a best user with p k det (HfcHj^) — > 1 can always be found. 

To sum up, our proposed criteria can be applied in various user scheduling algorithms. In particular, the 
geometrical angle (c.f., (9)) and grouping-oriented criterion (c.f., (12)) are more suitable for user grouping 
algorithms because both criteria emphasize on the integrated effect of all involved users. On the other 
hand, the selection-oriented criteria (c.f., (16) and (17)) focus more on the impact of a newly-recruited 
user on the sum rate capacity and therefore, they are more appropriate for user selection algorithms or 
our proposed hybrid user scheduling algorithms that will be presented in the next section. 

IV. Proposed Scheduling Algorithms for Heterogeneous Users 

In this section, we first review a conventional user grouping algorithm that is dedicated for scheduling 
spatially correlated homogeneous users, followed by presenting two reduced-complexity greedy-based 
user grouping algorithms. 

A. Conventional User Grouping Algorithm [11] 

Consider a downlink MU-MIMO system with K = 2L homogeneous users. Assume that the channel 
remains unchanged during the entire scheduling period of L timeslots such that these K users are divided 
into L groups of size G = 2. The objective of [11] is to design a user scheduling algorithm so as 
to minimize spatial correlation between two users per group (or equivalently, in the same timeslot) 

5 In (9), the user selection criterion is to find a user with the minimum value of the product of cosine square. It is equivalent 
to search for a user that maximizes the product of sine square, i.e., M(Hk, H,) = argmax fejg e sin 2 ipk,j- 
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while maximizing multiuser diversity. In this context, Wang and Murch [11] have considered using the 
largest principal angle as the user grouping criteria. In particular, given C\ l C 2L_2 ■ ■ ■ Cf / 'L\ possible 
arrangements 6 , a max-min operation is performed in which the smallest largest principal angle for each 
arrangement is first identified, followed by selecting the arrangement with the largest value among all 
these Cf L C 2L_2 • • • C\jL\ smallest angles as the best one. However, there are two main drawbacks in 
applying this algorithm to systems with heterogeneous users. 

1) Reduced Average Sum Rate Capacity. Due to the fact that homogeneous users are equipped with the 
same number of receive antennas, the group size can be heuristically set as a constant G = Mt/Mr, 
where Mf> k = Mr for all k. For heterogeneous users, however, it is not wise to determine the 
group size in advance because each user may have different number of receive antennas. For 
example, if we set G = [Mr/ max J, it is apparent that either the total dimension/degree 
of freedom per group cannot be fully utilized or a larger number of groups is required 7 , which 
results in a lower average sum rate capacity per group. On the other hand, if we set the group size 
according to the minimum number of receive antennas, the total number of receive antennas in a 
group will definitely exceed the number of transmit antennas, which results in the violation of the 
dimensionality constraint of BD. 

2) Huge Computational Complexity. Roughly speaking, the algorithm involves as many as 
C| L C 2L ~ 2 • • • C\jL\ possible arrangements. Since there are L groups for each arrangement, more 
than 0(L 2 ) comparisons 8 are required per arrangement. 

Because of these concerns, we have developed two hybrid user scheduling algorithms that takes into 
account some key features of user grouping and selection algorithms, i.e., to capture fairness among 
users and to maximize the system performance in a greedy manner. These two algorithms, which aim at 
minimizing group size and maximizing degree of freedom, are outlined in Tables I and II, respectively, 
and they are summarized as follows. 

6 Each arrangement consists of L groups, each of which has G = 2 users. 

7 Take a {1,1,1,2,3,4} x 6 MU-MIMO system as an example. Here, M T = 6, maxM fli = 4, and minM^ = 1. If 
the group size G is determined by [Mt / max Mn k J , then there exists only G = L6/4J = 1 user per group. In this case, 
there are 3 groups that separately consist of 1 user with single receive antenna, and the remaining degree of freedom (i.e., 
Mt — min Mjj fc = 5 dimensions) cannot be fully utilized. Further, since it requires Nq = 6 groups (or 6 timeslots) in serving 
all users, it is equivalent to the conventional TDMA scheme in which only one user is served by the BS at a time. Due to the 
lack of spatial multiplexing among users, the sum rate capacity per group is significantly reduced. 

8 Here, f(x) = 0(g(x)) is used to represent f(x)/g(x) = a when x — > oo, where a is a constant irrelative to x. 
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B. Algorithm 1: Group Number Minimization 

In contrast to the conventional user grouping algorithm that considers a constant group size, we 
alternatively consider variable group size and minimize the number of groups Nq required by setting 



N G = 



(20) 



M T 

Each group is allowed to have different number of group members as long as its total dimension (i.e., the 
total number of receive antennas) is smaller than or equal to Mt, i.e., the total degree of freedom available 
for interference-free transmission with BD. In this case, we can ensure that the algorithm provides the 
same fairness as the conventional algorithm but requires a fewer number of groups 9 . 

For each group T (s) , where g = 1,2,- •• ,N G , users are selected in such a way that better users 
have higher priority in getting the resources. In particular, we select the best Nq users with the largest 
Frobenius norm from the candidate user pool C = {1, 2, ■ ■ ■ , K} and assign them to be the first user of 
each group, i.e., 

u[ 9) = arg max ueC 1 1 H u 1 1 
T^ = {u{ 9) }, C = C\{u[ g) } 
For the remaining K — Nq users, they will be assigned to one of the Nq groups by certain criterion that 
aims at minimizing subspace correlation and interference non-orthogonality per group while maximizing 
the sum rate capacity. Taking into account the performance-and-complexity tradeoff, we consider the 
simplified selection-oriented criterion as given in (17). It is important mentioning that the idea of our 
approach is inspired by the idea of greedy selection but there are two main differences: 

1) While typical user selection algorithms aims at choosing the "best" user for a group/user subset, 
our algorithm alternatively help users select the best group. 

2) No user is allowed to be assigned into more than one group. In other words, each user is served 
by the BS only once within an entire scheduling period of Nq timeslots. 

The selection procedure is summarized as follows. Firstly, we identify a user (say, user-A;) with the largest 
Frobenius norm from the updated candidate user pool. Then, the simplified selection-oriented criterion is 
executed by selecting a group that has the largest incremental gain in sum rate capacity due to the 
enrollment of this user while the dimensionality constraint of BD is satisfied. This procedure can also 



9 We take a {1, 1, 1, 2, 3, 4} x 6 MU-MIMO system as discussed in footnote 7 as our example again. By using this proposed 
algorithm, the total number of groups Nq is significantly reduced from 6 to 2. 
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be mathematically expressed as 

u k = argmax feeC ||H fc || 

- - ci y^^> 

g s = argmaxi< 9 < G/ 9 A; det(H fc H^)sm 2 Z(Hr J ,H fe ), C = C\{u k }, 

such that rank(H.i 9 ^) + rank(Hk) < Mt- If the constraint cannot be satisfied, this user will be assigned 
to the next best group. As an important remark, once the user is selected, there is a user shedding step [6, 
34] in which it will be removed from the candidate user pool (i.e., C = C\{uk}) such that they are not 
being further considered in the remaining iterations. 

C. Algorithm 2: Degree-of-Freedom Maximization 

The idea of this algorithm is to fully utilize the total dimension/degrees of freedom for every group 
which, according to the dimensionality constraint of BD, is the number of transmit antennas at the 
BS, Mt- The algorithm is initialized by setting the group size to the number of transmit antennas, 
i.e., G = Mt- User selection is started at the first group by choosing the first user out of the K 
total users in the candidate user pool C with the maximum Frobenius norm, i.e., 

ui = argmax ueC ||H U || 
T« = {u 1 }, C = C\{ Ul } 
Like the previous algorithm, a user shedding step is performed such that the selected user will no longer 
be considered again in next iterations. 

Then, the next best users uj, for are chosen from the updated candidate user pool according to 
the simplified selection-oriented criterion given in (17) and it is mathematically written as 

u k = arg maxfeec Pk det(H fe H^) sin 2 Z(H S , H fc ) 

r (1) =r (1) uH c = c\{ Uk } 

This selection process for is terminated when the sum of channel ranks of the existing users and the 
new incoming user is larger than the remaining degree of freedom, i.e., rank(H s ) + rank(Hk) > Mt- 
Finally, the whole scheduling procedure repeats for the second group and so on until all of the 
K users have been assigned. 



D. Advantages 

Compared with the conventional user grouping algorithm [11], our proposed hybrid user scheduling 
algorithms have the following advantages. 
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1) Higher Average Sum Rate Capacity. The proposed algorithms provide an efficient mechanism in 
minimizing the number of groups required and utilize greedy-based criteria in enrolling as many 
users in one group as possible. In this case, the total dimension/degree of freedom per group is 
better exploited and the timeslots required for providing fairness for all users can be largely reduced, 
which results in a higher average sum rate capacity per group than [11]. 

2) Lower Computational Complexity. While the conventional user grouping algorithm requires about 
L 2 C 2 2 L C 2 2 L - 2 ■ ■ ■ CljU comparisons in finding an optimal grouping arrangement, these numbers 
are significantly required to approximately Nc{2L — Nq) and L(2L — 1) for Algorithms-1 and 2, 
respectively. 

V. Numerical Results 

Monte Carlo simulations are provided to evaluate the effectiveness of our three proposed user scheduling 
criteria (namely geometrical angle, grouping-oriented criterion and selection-oriented criterion) and the 
two proposed hybrid user scheduling algorithms in terms of the 10% outage capacity [35], which is 
defined as the rate that the channel can support with 90% probability. 

For the sake of simplicity, we consider a general Kronecker Product Form (KPF) channel model [36] 
for the simulation, i.e., 

Hfc = Y^R^^Hfc^R^ 2 (25) 

where pk = jj^ja is the received power of user-fc with Pt, dk and a being the transmit power, the distance 
between BS and k-th user, and path loss exponent, respectively. In addition, Hk,w £ C Mr <= xMt is a zero- 
mean unit-variance i.i.d. complex Gaussian matrix between the BS and the k-th user, R r fc G C Mr >> xMr >> 
and R t) £ G C MtXMt are the receive and transmit correlation matrices of user-fc, which can be modeled 
as [R r ,fc]ij = 7rfc J ' an d [R-t,fc]ij = T tk^ ' respectively, with correlation coefficients ~f r ,k,Tt,k 10 • 
Unless stated otherwise, the simulation configurations of some key parameters are listed as follows. 

• We employ BD as the linear precoding algorithm. Moreover, water-filling policy is considered for 
our numerical simulations even though the two theorems are derived by following an equal power 
allocation policy. 

• The number of receive antennas of the k-th user, M^ k , is randomly chosen from {1, 2, • • • , N} with 
equal probability, where N is the largest number of receive antennas in the system. 

10 This correlation model follows the Toeplitz structure in [37-39] and is valid for application scenarios where antenna elements 
at the receiver side are with equidistant spacings [37]. 
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• As for the received power pk, the path loss exponent a is set to 3 and the distance dk is randomly 
generated with range [200m, 1000m]. Then, pk is normalized by the maximum possible received 
power when = 200m. 

• The receive and transmit correlation coefficients 7^ and Tt : k are modeled as uniformly distributed 
variables with range [0,1]. 

As the first example, we evaluate the effectiveness of the first two proposed criteria, namely geometrical 
angle (9) and grouping-oriented criterion (12), by comparing their performance with the largest principal 
angle, subspace collinearity and chordal distance. Two baseline criteria, namely exhaustive search and 
random selection, are also considered. Fig. 3 shows the 10% outage capacity of these user grouping criteria 
for a {1, 1, 1, 2, 3, 4} x 6 MU-MIMO system. It can be observed that our proposed criteria perform well 
and both of them outperform the largest principal angle. Further, the grouping-oriented criterion achieves 
the average sum rate capacity per group of the exhaustive search method at high SNR while requiring 
less computational complexities. In Fig. 4, we take a closer look at the difference in sum rate capacity 
between the largest principal angle and our proposed grouping-oriented criterion by considering different 
user configurations. In general, our proposed criterion performs as good as the largest principal angle in 
homogeneous environment (c.f., {2,2,2,2,2,2} x 12 configuration) due to the fact that the latter does 
sufficiently reflect the spatial separability across homogeneous users. Nevertheless, the advantage of our 
proposed criterion becomes much more evident when the users are equipped with different number of 
receive antennas. In such a heterogeneous setting, it can be observed that our proposed criterion shows an 
improvement of 2 bps/Hz and 10 bps/Hz for {1, 1, 1, 2, 3, 4} x 12 and {1, 2, 3, 4, 5, 6} x 12 configurations, 
respectively, at a SNR of 40 dB. These results verify that our proposed criterion is a more appropriate 
performance metric in scheduling heterogeneous users in downlink MU-MIMO systems. 

To illustrate the effectiveness of our third proposed metric, i.e., selection-oriented criterion, we apply 
it into a greedy user selection algorithm whose details are presented as follows. Let C = {1, 2, ■ ■ ■ , K} 
and T = be, respectively, the set of the candidate user pool (in which the initial state consists of all K 
users) and the scheduled user pool (i.e., a subset of selected users that will be served by the BS). The 
algorithm is initialized by specifying a maximum degree of freedom D available for interference-free 
transmission. Due to the dimensionality constraint of BD, it is usually an integer no larger than the total 
number of transmit antennas 11 , i.e., D = Mr- User selection is started by choosing the first user with the 

"if all users are equipped with single antenna, it is also equivalent to the number of users that can be supported at one 
scheduling timeslot. 
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maximum Frobenius norm, i.e., u\ = argmax Mg c ||H u ||f. The two user pools are updated accordingly 
as T = TU {u\} and C = C\{u±}. Then, the next best users Uk, where k G C, are chosen from the 
updated candidate user pool according to one of our proposed user selection criteria and we hereby 
take the simplified selection-oriented criterion given in (17) as an illustrative example. The selection 
process will be terminated if the sum of channel ranks of the existing users in the subset and the new 
incoming user is larger than the remaining degree of freedom, i.e., rank(H s ) +rank(Hk) > Mr- Fig- 5 
illustrates the effectiveness of our proposed selection-oriented criterion. We consider a heterogeneous 
MU-MIMO broadcast channel, where the BS has 12 transmit antennas and each of the 20 users equips 
with either 1 or 2 receive antennas. For comparison purpose, we have also considered (a) the greedy 
zero-forcing algorithm with BD precoding [7,9] and (b) applying the largest principal angle into the 
user selection algorithm. As referred to the figure, it is clear that the largest principal angle does not 
perform well because of its incapability in reflecting users' spatial separability accurately. On the other 
hand, our proposed simplified selection-oriented criterion performs better than the greedy zero forcing 
algorithm and it can be observed that the performance difference increases with SNR (for example, from 
less than 5 bps/Hz at 20 dB to more than 10 bps/Hz at 40 dB). In addition to the simplified selection 
criterion, we have also shown the performance of the original version of our proposed criterion (16) 
that considers both C ga / m and C/ oss . It is seen that the capacity improvement is even higher despite an 
increase in computational complexity but the performance of the simplified criterion (17) would approach 
the original one (16) when the number of users is asymptotically large. It is because when the number 
of users in the candidate user pool increases, we can always find a user with the largest gain in sum rate 
capacity and a relatively smaller capacity loss. 

Apart from comparing the performance of various user scheduling criteria from the SNR point of view, 
we also investigate into their performance in terms of the number of users available for scheduling. In 
order to demonstrate the effectiveness of our proposed selection-oriented criterion (16), we also consider 
an optimal user selection by exhaustive search. As referred to Fig. 6, it can be observed that at high SNR, 
the performance of our proposed selection-oriented criterion approaches that of the optimal user selection 
when the number of users is large enough (e.g., 30 users). There is also an interesting observation on 
geometrical angle. Namely, while its sum rate capacity is the lowest among all possible criteria when 
there are only a few users in the candidate user pool, its performance increases with the number of users 
and approaches that of our proposed selection-oriented criterion. This observation is consistent with our 
findings in Section HI.B that the impact of the channel volume (i.e., det (H^H^) in (16) and (17)) 
on the sum rate capacity vanishes when the number of users is asymptotically large. 
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Lastly, we demonstrate the effectiveness of our proposed hybrid user scheduling algorithms over the 
conventional one [11] in a {1, 1, 1,2,3,4} x 6 MU-MIMO configuration. As discussed in Section IV.A, 
the group size of [11] is pre-determined in advance and its average sum rate capacity per group is 
expected to be lower than that of our proposed algorithms. In view of these concerns, we apply an 
optimal user grouping strategy by exhaustive search in [11], while considering only the sub-optimal yet 
simplified selection-oriented criterion (17) for our two proposed algorithms. It can be observed from the 
numerical results in Fig. 7 that though the performance of our two proposed algorithms are slightly inferior 
than [11] at low SNR due to the asymptotic SNR approximation (38) for C ga i n , they perform better at 
high SNR while requiring significantly less computational complexities. Fig. 8 also shows a relatively 
fair comparison in which all of the three user scheduling algorithms employ the largest principal angle as 
the performance metric. As it can be seen, the Group-Number Minimized Algorithm 1 outperforms [11] 
for the entire SNR of interest while the Degree-of-Freedom Maximized Algorithm 2 shows its superior 
performance at high SNR. This performance difference is mainly due to the grouping arrangement in 
the first algorithm in which the best Nq users (in terms of the Frobenius norm) are distributed among 
all groups such that they are not competing with one another for resources. These users can then be 
allocated resources with higher priority and exert a higher influence on the sum rate capacity. Based on 
these results together with the performance-and-complexity tradeoff, it is clear that our proposed hybrid 
user scheduling algorithms are a promising candidate to be applied in heterogeneous environment. 

VI. Conclusion 

In this paper, we have investigated into the design of user scheduling metrics for downlink MU- 
MIMO systems with heterogeneous users. We study users' channel characteristics in a subspace approach 
by representing the mutual interference across users that are originated from the interference non- 
orthogonality as the inter-user subspace correlation, and find that those conventional subspace-based 
user scheduling criteria that are commonly used in homogeneous users do not accurately reflect users' 
spatial separability. In response, we design from a geometric point of view three effective user scheduling 
metrics that aim at maximizing sum rate capacity while minimizing interference non-orthogonality among 
users. We also propose two hybrid user scheduling algorithms that can capture fairness among users while 
maximizing sum rate capacity in a greedy manner. When compared with the conventional user scheduling 
algorithm, our proposed approaches have lower computational complexities and shown to achieve a higher 
average sum rate capacity. 
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Appendix 

A. Proof of Theorem 1 

For BD [5], the precoding matrix of the A;-th user is expressed as a product of two precoders F afc 
and Fb k , in which the former is used for suppressing multiuser interference and the latter is used for 
performance optimization (e.g., sum rate maximization), i.e., 

F fc = /3F afc F bfc =/3v[ 0) F 5fc , (26) 

where the columns of V"j^ act as basis vectors that span the null space of the interference channel H^, 
and j3 is chosen such that the total transmit power at the BS is less than the maximum transit power 
constraint Pp. Assuming equal power allocation, i.e., /3 2 Ffe fc F^ = I, the sum rate capacity of the £>th 
user can be written as follows. 

C k = log 2 det(l MT + ^HfH fe F fe F;^ (27) 
= log 2 det (V T + H fc F fc Ff) (28) 

= log 2 det (l MT + |v«E^i^vfvi^) , (29) 

where (27) is due to the zero-interference constraint of BD that ensures H^F^ = [5], (28) is due to 
the definition of (i.e., = v /7jprl; s ), and (29) is due to the assumption of /3 2 Ffe fc F^ = I and the 
eigenvalue decomposition of H^, with = dmgjA^i, A^j ■■■ , ^k,M Rk } being its singular matrix and 
the columns of being the basis vectors spanning its range space. 

Denote = vj^^vj^. The sum rate capacity can be expressed in terms of the eigenvalue matrix 
of T^Tf as follows. 

C k = log 2 det(l MT _ Zfc + ^A(T fe S 2 fe T^, (30) 

where is the rank of and A(-) represents the corresponding diagonal matrix. Though (30) is 
exact, it is not easy to obtain any insight on user scheduling criteria and therefore, we resort to develop 
an upper and a lower bounds of Ck by using the following propositions. 

Proposition 1 (Upper bound on the determinant of a matrix [40]). For any positive definite matrix Mp, 
the following relation holds 

det(Mc) < (-tr(M c )) , (31) 



where m is any positive integer. □ 
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Proposition 2 (Trace inequality for matrix product [41,42]). For any two Hermitian positive semi-definite 
matrices Mo and Mg, there holds 

n n 

J2 \i(M D )\ n _ i+1 (M E ) < tr(M D M E ) < £ Xi(M D )X t (M E ), (32) 

i=l i=l 

w/iere Aj(-) is the i-th singular value of the operated matrix. □ 
By using Proposition 1, (30) can be upper-bounded as 

log 2 det (l MT _ Lk + ^A(T fe £ 2 lf )) < M Rk log 2 (l + ^L^tr(T k ^M)) 

= M R JogJl + -/^tr(V 2 k T%T k )). (33) 

Denote \i(T k ) = sin^^ with kki being the i-th principal angle of the two subspaces vj^ and 
Vj^. The sum rate capacity (33) can further be upper-bounded as the following closed-form expression 



by using Proposition 2. 



M 



C k < M R Jo g2 [l + J ^Y X h s ^ k^ 



< M Rk log 2 I 1 + jg-fk^a, £ S m 2 6 kki \ , (34) 

where \ ktmax = Afc,i is the maximum eigenvalue of 

Similarly, the capacity lower bound can be developed by using Proposition 2 as follows. 



Ck = log 2 det (l MT _~ Lk + gA(T fe S 2 fe Tf)) 
> log 2 + ^tr(T fc S 2 T^ 



log 2 ( 1 + %r(£ 2 Tf T fc ; 



™ i=l 



> log 2 (l + 0A 2 imm Esm 2 ^], (35) 



1=1 



with \h,mix = ^k,M Rk being the minimum eigenvalue of H^. 
This completes the proof of Theorem 1 . 
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B. Proof of Theorem 2 

For a .ftT-heterogeneous user downlink MU-MIMO system, the sum rate capacity is updated by an 
amount AC when a new user is added in the selected user subset T [33], i.e., 

AC = Cgain Ci oss , 

where C ga i n refers to the gain in sum rate capacity due to this new user and it is given by 
Cgain = log 2 det ^I Mt + ^Hf H fe F fe F^ 

= log 2 det (l MT + |v fe E^Vfv(0)v(0)") 

= log 2 det (l MRk + |E fe ^vi^V(°) Vi°)*V«) , (36) 

with the precoder 12 = /^V^F^, while the columns of Vj^ and being the basis vectors that span, 
respectively, the range space of the incoming user-/c's channel and the null space of the aggregated 
channels of the existing users in the subset (i.e., H s ). 

Denote = vf'^V^. We can asymptotically approximate (36) with respect to received SNR as 

c gam = io g2 det(i Mflfc + ^s^if r fc ) 

« log 2 4det(E fc S^)n\ ? (r fc ) 
\ an i=i 

/ M « k \ 

= log 2 I ^ det (£ fe l£) J[ sin 2 fc>M 1 , (37) 

where Aj(rfc) = sin#fc jSi j with 0£ iS) j being the z-th principal angle of the two subspaces and V^. 
Following the definition of geometrical angle in Section III.B 13 , the approximated gain in sum rate 
capacity (37) is written as 

C gain ~ log 2 det sin 2 ^ = log 2 det (H fc fif ) sin 2 V M ) . (38) 

Though the sum rate capacity is increased due to the incoming user-k, its presence in the subset 
induces interference and hence performance loss with the existing users. Denote, respectively, the sum 

12 Recall in Appendix A that we consider the policy of equal power allocation for each precoder such that /3 2 F(, fc F^ = I. 
"Similarly, geometrical angle can be also represented by the sine function [43], i.e., sin 2 xpk,a = Yli=i sul2 0fc,s,»- 
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rate capacity before and after enrolling user-fc as C pre and C pos t, the loss in sum rate capacity Ci oss can 
be quantified in the following way. 



where 



Closs — Cpre Cp 0S t, 



and 



C pre = log 2 det ( 1m t H — 2 HjFjFj 

= ^log 2 det(l MT + 4 H f H ^ v r\^ v r\f) (39) 

C post = ^log 2 det(l Mr + ^HfH,v|^. )nfc V^ )nfc ), (40) 

with V^^ njt being the intersection of the null spaces of H s \Hj and In order to make V^^ nfc 
tractable, we apply alternating projection algorithm [44] into ^^\j)nk^n\j)nk sucn tnat tne intersection 
of two subspaces is approximated by the infinite power of the product of their projection matrices, namely, 

v (0) V (0)H _ / v (0) (0)H v (0) v (0)H\ K 

- (v^vWfvfvf ff v^vWf)-, (41 ) 

As referred to [45], Fuchs al. show by simulations that k = 3 is sufficient enough for their appli- 
cation scenarios of interest. Since our main focus is to investigate into the relationship of the capacity 
change from a geometrical viewpoint, rather than to come up with an exact closed-form expression, we 
consider k = 1 for the ease of our subsequent derivation. Then, C pos t in (40) can be approximated as 

C m „ « £log 2 de,(l„ T+ 'H» Hi V^Vy|»VrVrvft<"). («) 

with the columns of being the basis vector that span the null space of H s \Hj, i.e., the aggregated 
channels of the existing users except user-j. 
Given (39) and (42), we have 

C\oss = Cpre Cp OS t 

« log 2 det (l MT + -l H f H,V^ V^f ) 

- Elo & de t (l„ r + ^HfH,vW V ^vfvf"vWv(««y (43) 
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Since it is well known that the following relation 

log 2 det(I + M F + M G ) - log 2 det(I + M F ) = log 2 det(I + M G ) 

holds for any two matrices M.f and that are orthogonal to each other (i.e., M^M^ = 0), (43) is 
then simplified as 

c,„„ « Y.^(^ r + ^?^^>ry%y^)^ (44) 

Further denote Tj s = V^yV^ and Y& s = V^^Vjj, 1 ^. We can asymptotically approximate (44) with 
respect to received SNR in the following way. 

Cioss ~ E log 2 det ( Im H3 . + ■^ 2 -^jr^J"K*|iT*fcs"f E^"is J 

iog 2 det ^s J -SJrgr fc- r £t 3 



Mr, Af H , 



= E 4 det (^J) II Af (T js ) n A?(T fcs ; 
ier \ n i=i i=i 

= E log 2 4 det (SjSj) [] sin 2 ^- sV) < n sin 2 , (45) 

iGT V/™ i=l i=l / 

where Aj(Y JS ) = singly ^ with Oj^ s \j yi being the i-th principal angle of the two subspaces 
and V^, and similar definition holds for Aj(Yfc s ). 

Similar to the derivation of C ga i n , we follow the definition of geometrical angle and rewrite the 
approximated loss in sum rate capacity as follows. 

Qoss ~ E log 2 ( 4" det (^i^D sin2 ^j,s\j sin2 ^fc,s\j ) 
jeT V£J ™ ' 

= E log 2 ( 4 det (HjHf ) sin 2 iP jAj sin 2 ) . (46) 

jeT VCJn J 

With (38) and (46), the change in sum rate capacity due to the new incoming user AC is obtained 
and expressed in terms of geometrical angles. This completes the proof of Theorem 2. 
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Table I 

Hybrid User Scheduling Algorithm- 1: Group Number Minimization 



1: N G =[Y.LiM Rk /M T \ 

2: C = {1,2,--- ,K} 

3. J-(l) = J-(2) = . . . = T (N G ) = 

4: for g = 1 — )■ Nq do 

5: = argmax ueC ||H U || 

6: =T (9) U{ni 9) } 

7: C = C\{u[ g) } 

8: end for 

9: while |C| > do 

10: for g = 1 ->■ iV G do 

ll: Hi s) = {H,,iGr^} 

12: end for 

13: u k = argmaxfcec ||H fc || 

14: g s = arg maxi< 9 <Ar G p fc det(H fe Sf ) sin 2 Z(Hi ff) , H fc ) 

s.t. ran/c(Hi 9) ) + ran/c(H fc ) < M T 

15: if { 5s } / then 

i6: r (9s) = r (9s) u {u fc } 

17: else 

18: iVc <- N G + 1 

19: T (AT G ) = T (N G ) u 

20: end if 

21: C = C\K} 

22: end while 
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Table II 

Hybrid User Scheduling Algorithm-2: Degree-of-Fredom Maximization 



1: C = {1,2,--- ,K] 

2: T (1) = 

3: g = 1 

4: while |C| > do 

5: if T {9) == then 

6: ui = argmax ueC ||H„|| 

7: r^ = r^u{ni} 

8: C=C\{u 1 } 

9: else 

10: H S ={H M GT^} 

11: tifc = argmax ueC Pfcdet(H fc H^)sin 2 Z(H s ,H fe ) 

s.t. rank(H s ) + ranfc(H fc ) < M T 

12: if {u fe } / then 

13: T { 9) = T (9) U {u k } 

14: C = C\{u k } 

15: else 

16: 5^5+1 

17: end if 
18: end if 
19: end while 
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Fig. 2. Geometrical illustration of (a) geometrical angle; and (b) the loss in sum rate capacity due to a new incoming user. 
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Fig. 3. 10% outage capacity performance of various user grouping criteria. A {1,1,1,2,3,4} x 6 MU-MIMO system is 
considered. The proposed criterion refers to the "Grouping-Oriented Criterion" in Section III.B.2. 




Fig. 4. 10% outage capacity performance of three different MU-MIMO configurations, namely {1,2,3,4,4,4} x 12, 
{2,2,2,2,2,2} x 12, and {1,2,3,4,5,6} x 12. The two user grouping criteria considered are the largest principal angle 
and the proposed "Grouping-Oriented Criterion" in Section III.B.2. 
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Fig. 5. 10% outage capacity performance of various user selection criteria. The MU-MIMO system configuration consists of 
Mt = 12 transmit antennas at the base station and 20 users with either 1 or 2 receive antennas. The proposed criterion refers 
to the "Selection-Oriented Criterion" in Section III.B.3. 
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Fig. 6. The impact of the number of users on the 10% outage capacity performance of various user selection criteria. The 
MU-MIMO system configuration consists of Mt — 12 transmit antennas at the base station. Each user is equipped with either 1 
or 2 receive antennas. The proposed criterion refers to the "Selection-Oriented Criterion" in Section III.B.3. 
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Fig. 7. 10% outage capacity performance of the two proposed hybrid user scheduling algorithms (Group Number Minimized 
Algorithm 1 and Degree-of-Freedom Maximized Algorithm 2) and the user grouping algorithm [11]. A {1, 1, 1, 2, 3, 4} x 6 MU- 
MIMO system is considered. "Selection-Oriented Criterion" as presented in Section III.B.3 is applied for the proposed algorithms 
while exhaustive search is applied for [11]. 




Fig. 8. 10% outage capacity performance of the two proposed hybrid user scheduling algorithms (Group Number Minimized 
Algorithm 1 and Degree-of-Freedom Maximized Algorithm 2) and the user grouping algorithm [11]. A {1, 1, 1, 2, 3, 4} x 6 MU- 
MIMO system is considered. All user scheduling algorithms apply the largest principal angle as a user scheduling metric. 



